Zipf's law in short-time timbral codings of speech, music, and environmental sound signals

ثبت نشده
چکیده

Timbre is a key perceptual feature that allows to discriminate between different sounds. Timbral sensations are highly dependent on the temporal evolution of the power spectrum of an audio signal. In order to quantitatively characterize such sensations, the shape of the power spectrum has to be encoded in a way that preserves certain physical and perceptual properties. Therefore, it is common practice to encode shorttime power spectra using psychoacoustical frequency scales. In this paper, we study and characterize the statistical properties of such encodings, here called timbral code-words. In particular, we report on rank-frequency distributions of timbral code-words extracted from 740 hours of audio coming from disparate sources such as speech, music, and environmental sounds. Analogously to text corpora, we find a heavy-tailed, Zipfian distribution with exponent close to one. Importantly, this distribution is found independently of different encoding decisions and regardless of the audio source. Further analysis on the intrinsic characteristics of most and least frequent code-words reveals that the most frequent code-words tend to have a more homogeneous structure. We also find that speech and music databases have distinctive code-words while, in the case of the environmental sounds, this database-specific code-words are not present. Finally, we find that a Yule-Simon process with memory provides a reasonable quantitative approximation for our data, suggesting the existence of a common simple generative mechanism for all considered sound sources. Our results provide new evidence towards understanding both sound generation and perception processes and, at the same time, they suggest a potential path to enhance current audio-based technological applications by taking advantage of knowledge about the found distribution. URL http://dx.plos.org/10.1371/journal.pone.0033993 [5] Source URL: https://www.iiia.csic.es/en/node/53956 Links [1] https://www.iiia.csic.es/en/bibliography?f[author]=185 [2] https://www.iiia.csic.es/en/staff/joan-serr%C3%A0 [3] https://www.iiia.csic.es/en/bibliography?f[author]=186 [4] https://www.iiia.csic.es/en/bibliography?f[author]=187 [5] http://dx.plos.org/10.1371/journal.pone.0033993

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zipf's Law in Short-Time Timbral Codings of Speech, Music, and Environmental Sound Signals

Timbre is a key perceptual feature that allows discrimination between different sounds. Timbral sensations are highly dependent on the temporal evolution of the power spectrum of an audio signal. In order to quantitatively characterize such sensations, the shape of the power spectrum has to be encoded in a way that preserves certain physical and perceptual properties. Therefore, it is common pr...

متن کامل

Speech and Music: Acoustics, Signals and the Relation between Them

The codings of speech and music are different and in some ways complementary. The voice operates on acoustical principles distinctly different from those of (other) musical instruments. This paper explains how and poses the question: are the different codings the result of the different acoustics, or vice versa? What if the instruments came first? This paper develops the conjecture that the pit...

متن کامل

کاربرد الگوریتم جداسازی کور منابع در جداسازی سیگنال‌های گفتار و موسیقی

In this paper, the application of the Independent Component Analysis In this paper, the application of the Independent Component Analysis technique in speech-music separation is discussed. The separation algorithm is in the time domain. It needs the score function estimation to minimize the mutual information. For estimating score function, sufficient samples of the mixed (speech-music) signals...

متن کامل

شناسایی خودکار سبک موسیقی

Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...

متن کامل

Self-Organizing Topological Timbre Design Methodology Using a Kohonen Neural Network

Generating sounds for music composition with the desired timbral characteristics has been a challenge ever since the dawn of electroacoustic music. Timbre is a remarkably complex phenomenon that has puzzled researchers for a long time. Actually, the nature of musical signals is not fully understood yet. In this paper, we present a sound synthesis technique that uses Kohonen’s one-dimensional se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017